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Abstract 

We propose a paradigm for concurrent natural 
language generation. In order to represent gram- 
mar rules distributively, we adopt categorial unifi- 
cation grammar (CUG) where each category owns 
its functional type. We augment typed lambda 
calculus with several new combinators, to make 
the order of A-conversions free for partial / local 
processing. The concurrent calculus is modeled 
with Chemical Abstract Machine. We show an ex- 
ample of a Japanese causative auxiliary verb that 
requires a drastic rearrangement of case domina- 
tion. 



2 Extended typed A-calculus 

CUG (Categorial Unification Grammar) [^ is ad- 
vantageous, compared to other phrase structure 
grammars, for parallel architecture, because we 
can regard categories as functional types and we 
can represent grammar rules locally. This means 
that we do not need externally-given grammar 
rules but those rules reside within each word or 
each phrase. In this section, we regard categories 
as polymorphic types and consider the type cal- 
culus. In later sections we denote categories by 
DAG (directed acyclic graph) of PATR grammar 
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1 Introduction 

Parallel and distributed computation is expected 
to be the main stream of information process- 
ing. In the conventional generation, the rules 
for composition are given from the outside and 
those rules control all the behavior of the symbols 
or the objects, for assembling a hierarchical tree 
structure. For example, all the linguistic objects, 
such as words and phrases must be applied to so- 
called grammar rules to form grammatical struc- 
tures or rational semantic representations, under 
a strict controller process. However, this kind of 
formalization obviously contradicts the partial / 
distributed processing that would be required in 
parallel architecture in future. 

In order to represent grammar rules distribu- 
tively, we adopt categorial grammar, where we can 
an attach local grammar rule to each word and 
phrase. What we aim in this paper is to propose 
a paradigm that enables partial / local genera- 
tion through decompositions and reorganizations 
of tentative local structures. 

In the following section, we introduce the ex- 
tended A-calculus. Thereafter we introduce the 
ChAM model and we reinterpret the model in 
terms of natural language processings. Then we 
show the model of membrane interaction model 
with the example of Japanese causative sentence 
that requires drastic change of domination of 
cases. Finally we will discuss the future of the 
model. 



2.1 A-calcuIus of polymorphic type 

We use greek letters, for type schemas. For type 
constants we use o", r, • • • while for type variables 
we use a, /3, • ■ ■. a : a represents that the object a 
is of type a. If a and P are types, then a — > /? is 
a type. 

The purpose of type inference is to infer the 
type of an object from a set of objects whose types 
are known. We presuppose that two type variables 
a and /3 are unified with a unifier 9. We use P for 
this set of type-known objects. The most impor- 
tant two rules are as follows: |j 



P^i U {x : aOi} h t : /3 
P^i h Xx'^.t :a9i^/3 
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re29394 ht-.ae^^ pe^ T929-i94, ^ s-.aOA 

P02^3^4 ^ t{s) : /39i 

The rule (|2|) corresponds to /3-conversion of the 
ordinary A-calculus Q. 

2.2 Extended combinators 

In this subsection, we introduce two combina- 
tors that enable us to change the order of A- 
conversion, proposed by Steedman [^, as a kind of 
type change M. The ordinary A-calculus requires 



^ 62,63 are ioi re2 \- t : a -^ P and for FO^ h s : 
a, respectively. 64, unifies a wliicli appears in both type 
declarations. 



a strict order of conversion. However, in a concur- 
rent model, this kind of strict order is a hindrance 
and contingent conversions are required. 

C-combinator changes the order of A-variables 
as follows: 

C{\xy.f{x,y)) = \yx.f{x,y). 

Another requirement for exchanges of the order of 
A-conversion is the following case. Suppose that 
we are required to compose all the following typed 
objects: 

g : a^ P 
a : a 

In such a case, we need to concatenate g and a 
first, and then g{a) becomes applicable to /. How- 
ever, with the help of the following B-combinator: 

B{Xx.f{x)){Xy.g{y)) = Xx.f{g{x)). 

The A-variable in g can be shifted beyond the 
scope of / so that we can concatenate / and g 
first, and, thus, have a become applicable as in 
Fig. I. 



and semantic roles, as follows: 



>^y-9{y) 
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Xx.f{x) 






Xy-f{9{y)) 



f{9{a)) 
Figure 1: B-combinator 



2.3 Cost of unification 

The repeated use of C- and B-combinators is still 
problematic if we consider implementing it as an 
actual system because the termination of process- 
ing is not guaranteed. We have modeled the pro- 
cess of a partial decomposition as an abstraction 
of an argument of the first-order term. If this ab- 
straction occurs randomly, the process easily falls 
into a loop. In order to avoid this, we assume the 
unification cost. If a compound term (a subtree) 
were to be decomposed once, the element with 
the longer distance should be abstracted first. We 
can regard the whole sentence structure as more 
grammatical if the sum of these unification costs is 
smaller. We introduce the heuristic costs Q , con- 
sidering the parallelism between syntactic cases 



'{nom,agt) I 

n II 

'{dat,cgt) II 

n II 

' {acc,obj) II 



' (obj ,nom) I 

1 II 

'(dat,agt) II 

ht,obj)\\ = 



' {nom,dat) II '^^ 
\agt,cgt)\\ = OO 



where ^(^.j/) represents a unifier of two DAG's: 
one's syntactic case is x and the other's semantic 
role is y. k is some constant larger than 1 (A: > 1). 



3 Chemical Abstract Machine 

Chemical Abstract Machine (ChAM, for short) ||^] 
is a paradigm of concurrent A-calculus. In this pa- 
per, we will mention our principles on natural lan- 
guage processing with regard to the ChAM model. 

We assume the process of natural language 
recognition as follows. Whenever a linguistic ob- 
ject is recognized, it is thrown into the solution 
of ChAM, and acts as a molecule. Verbs and 
some other auxiliary verbs introduces membranes. 
These membranes becomes their scopes for case 
(or role) domination; namely, each verb searches 
for molecules (noun phrases) that are necessary 
to satisfy each verb's case (role) frame, within its 
membrane. In some occasions, if multiple verbs 
exist in one sentence, they may conflict as to which 
verb dominates which noun phrase. In such a case, 
two membranes can interact and can exchange 
some molecules. 

We use si, S2, ^s, • • • for membranes. When a 
membrane s, contains a molecule a, we denote as 
Si \= a. The supporting relation {\=) can be in- 
terpreted as an inclusion relation (d) in this case. 
Two membranes can interact when they contact 
with the notation '||', as si||s2. If there is a float- 
ing molecule (that which is not yet concatenated 
with other molecules) on one side, it can move 
through the porous membranes. Valences for con- 
catenation of each molecule are represented by 
typed ia77i6da-variables. If one membrane con- 
tains only one composite structure, and it still 
has surplus valences, we can regard that whole the 
membrane has those surplus valences as follows. 

S2 1= Xxyz.make{x,y,z) 

i 
Xxyz.S2 \= make{x , y , z) 

Now, we will apply our notions above to the 
actual problem of sentence generation. 



[yom- = read) 




Who reads? 


Ken-wa Naomi-ni hon-wo yom-u. 




Ken 


Ken-wa Naomi-ni hon-wo yom-ase-ru. 




Naomi 


Ken-wa Naomi-ni hon-wo yom-are-ru. 




Naomi 


Ken-wa Naomi-ni hon-wo yom-ase-(r)- 


are-ru. 


Ken 



I -wa : nominative case marker 

-ni : dative case marker 

-wo : accusative case marker 

hon : noun for 'book' 

yom,- : root of verb 'read' 

-ase- : auxiliary verb for causative 

-are- : auxiliary verb for passive 

y -(r)u : present tense marker 



Table 1: Agents alternation by agglutination of auxiliary verbs 



4 Example: Japanese causative 
sentence 

In the Japanese language, the causative and the 
change of voice are realized by agglutinations of 
those auxiliary verbs at the tail of current verbs. 
These auxiliary verbs as well as ordinary verbs 
can dominate some cases so that these agglutina- 
tions may change the whole syntax y]. Namely 
the scope of the operation of these auxiliary verbs 
is not the operated verb but the whole sentence. 
In order to illustrate these role changes, we show 
the alternation of the agent of the main verb in 
Table || with a short tip to Japanese lexicon. 
As an example, we will take the sentence: 

Ken-wa Naomi-ni hon-wo yom-aseru. 
(Ken makes Naomi read the book.) 

First, we give DAG's for each lexical items in 
Fig ^. The last DAG in Fig. ^ represents that the 
verb ^yomu (read)' requires two roles 'the reader' 
and 'the object to be read', and one optional role 
'the counter-agent' who hears what the reader 
reads. In that figure, W \=^ means that each 
word is recognized in the general world however a 
verb ^yomu'' introduced a special membrane si as a 
subworld of W. Each DAG means a polymorphic 
type of the lexical item. 

Assume that there is a parser that constructs 
partial tree structures, as recognizing each word 
from the head sequentially. Then, when the first 
four words are recognized, they can form a com- 
plete sentence of (^). 



si 1= {read{K\g^, N\0^, B\g^) : 



cat S 



} (3) 



made in the subworld of si. In (^, 0j's are the 
records of unification, that contain the costs and 
the original types; they become necessary when 
they are backtracked, and in that meaning, those 
bindings are transitive. 

Now, let us recapitulate what has occurred in 
the membrane si. There were four lexical items in 
the set, and they are duly organized to a sentence 
and si becomes a singleton. 



si = {K : N, N : N, B : N, 

Xxyz.read{x, y,z) : N ^ N 

i 
si = {read{K,N,B)} 



N ^ S} 



Because all the three nouns are adequately con- 
catenated by 'read', a sentential representation is 



Then, the problematic final word ^-aseru 
(causative)' arrives; its DAG representation is as 
in Fig. 1^. The DAG in Fig. ^ requires a sentential 
form (category S) as an argument, and in addi- 
tion, it subcategorizes an item of category A^ as 
an agent of the subsentence. 

Now, the process becomes as in Fig. H. All 
through the process in Fig. H, C- and B- 
combinators are used repeatedly as well as ordi- 
nary type inference (|l|) and (|^) . The second mem- 
brane S2 requires an agent role (the variable x' of 
make). There is a record in 9i that it bit agent, 
so that the comparison should be made between 
9i and 6i{= 0(^k,x'))- However, because both of 
9i and 64, unifies nominative case and agent role, 
the costs are equivalent. In such a case, linguistic 
heuristics will solve the problem. In this case, the 
agent of make should be the nominative of the 
whole sentence, and the co-agent of make is the 
dative of the whole sentence, so that K and A^ are 
bit by newly arrived make. B remains bound to 
read, because there is no A-variable of that type 
in make. The process is depicted in fig. |5[ 



W [= K{= Ken-wa) : 
W \= N{= Naomi-ni) : 
W \= B{= hon-wo) : 



cat N 
case nom 

cat N 
case dat 
cat N 
case ace 



si \= Xxyz.read{x,y, z){= yom-) : 
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Figure 2: Initial DAG 



Xxyz.make{x,y, z{y)) : 
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Figure 3: DAG for ^make' 



si\= read{K\0^^N\0^,B\0..^) \\ Xx'y'z' .S2 \= niake{x' ,y' ,z') 

i 

Xx.si\= read{x,N\e,^,B\e.^) \\ Xy'z' .S2 \= make{K\g^,y' , z') 

i 

Xxy.si \= read{x,y,B) \\ Xz'.S2 \= make{K\g^, N\g-, z') 

i 

si II S2\= make{K\g^,N\g^^,Xy.read{N,y,B)) 



Figure 4: Process 
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Figure 5: Membranes interaction 



5 Conclusion 

Introducing free-ordered typed A-calculus, to- 
gether with the notion of unification costs in 
types, we have shown the structuring of natu- 
ral language syntax, by distributively represented 
types in random orders. We adopted a model of 
Chemical Abstract Machine for the partial/ con- 
current computation model. 

Although we introduced the concept of costs 
and termination was assured, the efficiency of con- 
structing a parsing tree would be far slower than 
sequential processing. However our objective is 
not to propose a faster algorithm, but is to show 
the possibility of distributed processing of natural 
languages. We could show that natural language 
syntax is self-organizable, in that each linguistic 
objects do not need to be poured into 'molds', viz., 
externally given grammar. 
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